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ABSTRACT 


Recent  wcr*  with  Automatic  Speech  Recognition  has 
focusect  on  applications  and  productivity  considerations  in 
the  man-machine  interface.  This  thesis  is  an  attempt  to  see 
if  placing  users  of  such  equipment  under  time-induced  stress 
has  an  effect  on  their  percentage  correct  recognition  rates. 
Suojects  were  given  a  message-handling  task  of  fixed  length 
and  allowed  progressively  shorter  times  to  attempt  tc 
complete  it.  Questionnaire  responses  indicate  stress  levels 
increased  with  decreased  time-allowance;  recognition  rates 
decreased  as  time  was  reduced. 
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I.  INTRODUCTION 


Stress  has  long  been  an  intensely  active  area  of 
investigation.  Pore  recently,  the  intersection  of  physiology 
and  psychology  has  concerned  itself  vith  the  influence  of 
cognitive  and  errotive  factors  on  quantifiable  variable  n 
the  buiran  organism's  equation.  This  research  has  turned 
the  most  fundamental  indices  of  human  reaction  (car  c 
rate,  electrogal vanic  response)  to  more  subtle  measures 
as  various  features  of  the  human  voice. 

Stress,  however  defined,  has  clear  and  important 
implications  for  the  conduct  of  military  operations.  In  the 
warfare  environment,  one  encounters  some  of  the  few  factors 
universally  acknowledged  to  catise  psychological  stress:  it 
is  one  of  the  few  areas  where  one  knows  apriori  that  life- 
threatening  situations  are  consistently  found.  Thus  the 
psychc-physiologicai  correlates  cf  military  action  have 
furnished  both  an  area  of  inquiry  and  the  raw  data  for  those 
interested  in  the  study  oi  the  human  under  stress.  Swedish 
researchers  took  an  early  lead  in  the  investigation  of 
stress  in  the  military  environment.  [Ref.lJ 

The  investigation  of  any  phenomenon  requires  first  and 
foremost  a  statement  delimiting  the  area  cf  research.  Stress 
has  yet  to  receive  a  widely  accepted  definition.  Stress 
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research  tends  to  divide  into  tvc  camps:  these  interested  in 
physiological  stress  and  those  pursuing  the  psychological 
variety.  Both  claim  the  territcrlty  of  psychophysiology. 
Beyond  this,  the  definition  of  stress  breads  down,  in  the 
main,  into  an  examination  of  either  a  stimulus  or  a 
response:  there  appears  to  be  no  common  ground.  Cne  thine 
which  emerges  clearly  from  published  research  in  the  area  is 
that  a  synthetic  conception  of  stress  is  in  order,  cne  broad 
enough  to  cover  the  wide  (ar.d  widening)  area  of  inquiry 
already  embarked  upon,  and  which  would  permit  breakdown  into 
narrower,  more  specialized  areas  of  interest. 

Given  the  vagaries  of  the  published  resources  currently 
available,  it  is  necessary  within  the  context  of  the  present 
paper  to  work  within  an  operati onai iy-d  fined,  problem- 
oriented  view  cf  stress,  from  a  military  commander's  point 
of  view,  an  area  of  great  potential  interest  is  the 
interaction  cf  stress  with  voice-input  to  computer  systems. 
As  miniaturization  and  expanding  technology  have  made 
computers  a  more  widespread  phenomenon  in  the  military 
environment,  the  limiting  factor  (in  the  military  Just  as 
with  civilian  applications )  continues  to  be  the  human 
interface  with  the  ever-  expanding  base  of  information  and 
decision  aids  available.  Voice-input  to  computers  provides 
the  field  user  with  a  higber-order  interface,  at  a  level 
above  even  user-friendly  query  languages.  This  type  cf 
interface  has  the  potential  to  facilitate  the  user's  ability 


to  access  and  manipulate  the  vast  resources  new  becoming 
available  to  him:  it  is  the  next  logical  step  in  the 
evolution  of  man-machine  con  patabi ii  ty .  Computers  are  no* 
able  to  handle  problems  which  not  long  ago  would  have 
absorbed  an  enormous  amount  of  man-hours,  but  the  man  is 
still  "in  the  loop". 

...though  we  hail  the  advent  of  the  postindustrial 
society,  in  which  subs  tan tial iy  ail  our  essential  human 
needs  will  ce  produced  without  a  great  amount  of  human 
effort,  the  fact  remains  that  the  milleniurr  is  net  here 
yet.  Human  beings  are  not  yet  obsolescent  as  important 
contributors  to  the  productive  process.-  The  quality  of 
performance  cy  men  and  women,  whether  in  manufacturing 
or  servicing,  still  accounts  for  a  very  large  part  in 
the  variation  in  productivity,  however  one  cheeses  to 
define  the  criteria  of  productivity  -  and  this  condition 
will  continue  to  prevail  far  Into  the  future.  (Per.  Z: 

?.  XII IJ 


Automatic  Speech  Recognition  (ASR)  is  a  demonstrated, 
on-line  technology  cf  clear  interest  to  the  military 
commander.  (Fcr  a  discussion  of  the  range  of  voice  input 
appl ica tions ,  the  reader  is  referred  to  References  [3J  and 
1 4] )  The  rapidity  with  which  current  data  processing  and 
computer  networks  now  operate  necessarily  Imply  that  the 
machinery  spends  an  inordinate  part  cf  its  time  waiting  for 
input  from  the  human  operator.  Speeding  up  this  aspect  of 
the  interface,  a  quantum  leap  in  user-accomcdation ,  is  one 
of  the  major  contributions  to  be  expected  (and  even  new 
available)  from  speech  recognizers  linking  the  human  to  the 
computer.  Speech  reccgnitcn  systems,  while  they  generally  do 
not  affect  the  internal  workings  of  the  computer,  present 


il 


their  greatest  potential  advantage  in  increasing  the 
efficiency  of  the  total  human/machlne  system.  Industry  vas 
quick  to  recognize  this  potential:  voice  input  systems  are 
presently  In  use  In  various  sorting  and  assemniy-l  Ine 
applications . 


Computer 

input  by  voice  presents 

several 

advantages  in 

the  military  context 

,  especially 

i  n 

the 

Command 

Post 

environment . 

It  frees 

operators ' 

hands 

to 

enable 

other 

concurrent  tasks,  such  as  transcription  or  manipulation  of 
other  equipment,  increasing  productivity  in  both  areas. [Ref. 
<i]  Just  as  importantly,  it  provides  the  potential  for  the 
voice-operator  to  maintain  a  graphic  or  figurative  plot  of  a 
situation  he  may  be  monitoring  at  the  terminal.  This  wc\:id 
allow  others  not  familiar  or  comfortable  with  the  computer 
to  comprehend  and  follow  what  the  operator  may  see  unfolding 
at  his  work  station,  without  requiring  lengthy  am 
distracting  explanations  from  the  operator.  Voice-input 
could  enable  the  operator  to  continue  his  task  at  the 
terminal  and  simultaneously  manipulate  a  visual 
representation  of  the  problem  he  is  involved  in,  for  others' 
benefit.  This  is  a  potential  boon  in  the  period  of 
transition  from  a  symbolic  gestalt  tc  an  era  cf  much  mere 
widespread  computer  literacy. 

moreover,  cognitive  psychology  has  presented  a 
scientific  model  for  task  suitability  to  different 
modalities  which  takes  the  criteria  fer  task  assignment  far 
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beyond  the  obvious  oasts  cf  ease".  Sandry  and  WicJcens  [Bef. 
t:  pp.8-10]  have  proposed  the  concept  of  "ideorotor 
compatabill ty" :  human  Input/output  rrodes  nay  be  rratched  to 
the  senses  and  hemispheres  cf  the  brain  which  mcst  readily 
process  them.  Quicker,  more  accurate  responses  result  when 
responses  are  matched  to  stimuli  (such  as  hearing  and 
speaking)  .  Short  term  memory  for  verbal  information  is 
significantly  Improved  wben  that  information  is  presented 
auditorily . 

There  are  obvious  correlates  of  ideorotor  oompatabili ty 
in  a  situation  such  as  a  Command  Test.  The  commander  gives 
an  operator  verbal  instructions  regarding  a  test:,  ar. a  this 
operator  then  executes  these  with  voice  input  to  a  terminal. 
In  competing  for  the  human  operator's  mental  resources,  it 
is  Sandry  and  Wlckens'  thesis  that  nc  tlme-sharln*  decrement 
results  when  competing  tasks  are  "ideomotor  compatible". 

The  automated  resources  available  to  the  commander  in 
the  current  environment  are  expanding  and  gaining 
increasingly  widespread  acceptance.  The  applicability  of 
voice  input  to  computer  systems  is  therefore  both 
intuitively  and  logically  attractive,  and  has  been  ealcing 

increasing  attention.  Stress,  however  defined,  is 

/ 

universally  conceded  tc  De  a  factor  in  the  human  side  cf 
this  equation,  and  therefore  to  has  implications  of  military 


interest  in  the  command  and  control  environment. 


Given  these  ccns lderaticns ,  what  then  might  re  the 
effect  of  stress  cn  an  operator  of  a  voice  input  system?  In 
various  areas,  research  cn  psychological  stress  abounds,  and 
researchers  are  now  turning  their  attentions  specifically  to 
the  effects  of  stress  cn  the  vcice.  The  published  research 
in  this  latter  area  is,  in  the  main,  inconclusive.  It  is  tne 
intent  of  this  thesis  to  see  if  seme  relationship  exists 
between  stress  incident  cn  a  voice-input  operator  and  his 
success  rate  with  an  automatic  speech  recognizer.  It  is  to 
ce  emphasized  that  virtually  no  work  has  been  published  in 
this  particular  area.  The  current  study  proposes  itself  as  a 
preliminary  inquiry  into  a  ^henoreocn  of  potential 
s ignificance ,  and  one  which  nas  as  yet  received  little 
experimental  attention. 
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II.  VOICE  RECOGNITION  AND  STRESS 


A.  DEFINITION 

Central  to  any  study  of  the  interaction  cf  stress  er.d 
the  output  of  voice  recognizers  is  a  definition  of  what  is 
to  te  understood  cy  "stress".  As  stated  (cf.  supra.',  the 
aisagreeme r.t  in  defining  stress  is  so  func.arrer.ta l  that  even 
recent  treatments  differ  in  classing  stress  as  stiri:lus  t 
response.  In  certain  areas,  now  ever,  corsersus  emerges. 

Stress  in  tne  pays  loicgic si  sense  is  rcre  ciear-cut  than 
the  psychological.  Physiologists  ray  refer  unanaiguovsiy  to 
nuroL  stressors  such  as  neat,  piessure,  electric  shocir,  ana 
the  iiice,  in  tne  vey  cf  stimuli.  The  resultant  reactions  are 
also  referred  to  as  stress,  anc  are  measured  in  cardio¬ 
pulmonary  pararre te rs ,  pupil  dilation,  ar.a  similar  observeo'le 
and  liuantif la oie  phenomena.  When  speahing  of  physiological 
stress,  it  suffices  to  specify  whether  stimulus  cr  response 
is  mean  t.  Kany  stressor-agents  ha v t  oeen  extensively 
studied  ooth  in  this  ccuatry  and  acroad  for  their  military 
impacts . 

Ihe  grey  area  is  entered  with  psy choiogical  stress  and 
its  causes  or  man if estatiers.  Various  researchers  define 
stress  as  the  organism's  reaction  (coping  or  adaptive)  tc  an 
unfriendly  environment.  Others  define  stress  as  the  inimical 
influence  of  the  environment  on  the  individual.  In  recent 
usage,  tne  former  definition  appears  to  have  gained 


ascendancy  over  the  latter,  although  the  debate  is  far  from 
settled. 

It  Is  therefore  Incumbent  tc  define  psychological 
"stress"  before  proceeding  further.  The  definiton  must  be 
sufficiently  droaa  to  span  a  wide  gamut  of  po  tentiai 
phenomena,  and  thus  preserve  meaingfulness .  In  addition,  it 
nust  be  sufficiently  narrow  to  be  useful  and  workable  within 
the  necessarily  limited  scope  of  this  experiment.  A 
synthetic  definition  of  stress  will  here  mean  "the  arousal 
produced  within  the  individual  cy  unwelcome  stimulus  cr 
threat".  This  is  tie  operational  definition  which  will  be 
cdoptea  in  an  atterpt  to  examine  tne  end-effect  of  producing 
this  type  of  arousal  in  subjects  interacting  with  an 
automatic  speecn  recognizer. 


There 

is 

eitens i ve 

literature  on 

the 

subject 

of 

psy  chc icgi 

Cd  i 

stress 

(much 

of  which 

fails 

wi tnin 

the 

definition 

accve)  and 

its 

correlates , 

especially 

the 

physiological  correlates.  The  i\avy  has  done  extensive  worn 
in  the  field,  most  notably  in  Par  Diego.  [Ref.  6] 
Psychological  stress  may  be  further  oronen  down  into 
interesting  subcategories.  1)  Situational  stress  is 
generally  held  tc  be  the  influence  cn  the  individual  of 
unfavorable  environmental  lout  non-physical)  factors.  These 
factors  are  beyond  the  individual's  control  and  are 
eenerally  thought  to  consist  of  a  certain  set  of 
circumstances  such  as  public  speaking,  deadlines,  ana  the 

it 


iiKe.  c)  Seir-induced  stress  is  generally  recognized  as  the 
self-imposi tion  ot'  a  condition  or  stimulus  leading  to  tne 
type  or  arousal  denned  above.  Ixarrpies  include  goals, 
deadlines,  or  performance  requirements  of  any  type  with 
which  the  individual  forces  himself  to  tunction  above  a 
"ccrnr crtacie"  or  "easy"  level. 

These  distinctions  often  merge,  however.  In  a  prime 
example,  Swedish  military  pilots  were  given  a  relatively 
complex  set  of  procedures  to  execute  in  an  unsophisticated 
riignt  simulator,  ana  promised  a  reward  for  high 
performance.  [Ref .  7]  The  situation  was  imposed  on  them, 
and  could  oe  construed  as  situational  stress.  The  desire  to 
perform  at  a  high  level,  however,  was  self-induced.  An 
interesting  outcome  of  the  experiment  was  that  the 
pnysicicgicai  indicators  of  the  pilots  in  question  reacted 
remarxaDiy  liKe  those  of  pilots  in  a  similar  non-simula ted 
situation.  A  further  interpretation  of  tnese  studies  was 
that  subjects  had  come  to  accept  the  reality  of  the 
simulated  situations,  ana  tneir  physiological  indicators 
retlected  as  much.  Such  studies,  in  general  ■  tend  to  support 
the  validity  of  simulated  situations  in  conducting  stress 
research. 

B.  STRESS  AM  THE  VOICE 

Lespite  the  abundance  of  general  stress  literature, 
relatively  little  has  been  done  on  how  this  phenomenon  might 
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manifest  itself  in  the  buman  voice.  Micks  [Ref.  £]  gives  a 
thoughtful  overview  of  the  recent  literature,  in  addition  to 
bis  own  experimental  results  with  various  forms  of  stress 
ana  voice.  Re  concedes  the  generally  accepted  notion  that 
stress  has  an  influence  on  individuals'  fundamental  speaking 
frequency  (io)>  be  concludes  (as  do  others)  that  this 
parameter  is  as  likely  to  go  up  as  down,  with  concomitant 
frequency-range  variations.  The  magnitude  aDd  direction  of 
this  sain  in  5o  is  a  function  oi  me  individual  speaker. 
Additionally ,  tne  amount  of  snift  required  tc  constitute  an 
indication  of  stress  is  stiii  a  sutjeciive  assessment.  The 
cne  conclusive  result,  reported  by  Licks  and  otners  [Ref.  6: 
p.9y ,  p.lii4j  is  that  the  ratio  of  speaking  time  to  total 
time  will  increase  in  a  stressed  speaker.  This  is  consistent 
with  everyday  experience:  "stressed"  speakers  often  appear 
to  taiK  in  longer  bursts,  with  shorter  pauses  separating 
them.  Parameters  other  than  the  speech-to-pause  ratio  give 
inconclusive  or  contested  results. 

A  large  amount  of  commercial  interest  has  focused  on  the 
influence  of  stress  on  vocal  parameters.  One  can  imagine  the 
potential  interest  in  being  able  to  evaluate  the  stress 
level  of  a  speaker.  If  done  by  voice,  this  analysis  could  be 
performed  at  great  distances  (via  telephone,  radio,  etc.) 
ana  without  the  speaker's  knowledge.  Several  firms  have 
marketed  devices  purported  to  analyze  the  stress  level  of 
speakers  (as  a  sort  of  voice  polygraph),  but  their  claims 


appear  subject  to  debate  ana  are  hotly  contested  In  academic 
circles.  These  devices  base  their  analysis  cn  the  existence 
of  microtremor  in  the  muscles  controlling  vocal  activity,  a 
phenomenon  which  rerains  to  oe  shown  conclusively.  [Ref.  9: 
p.l?£] 

isorever,  the  readout  provided  ty  voice  stress  analyzers 
requires  rather  subjective  interpretation.  As  noted,  vocal 
variations  so  far  demonstrated  have  shown  a  sub ject-speciric 
nature:  where  one  subject's  ic  rises,  another's  may  fall, 
etc.  Thus,  analysis  of  a  speech  sarjie  for  stress  requires 
some  apricri  Knowledge  of  the  individual  speaKer's 
tendencies.  Analytic  efforts  so  far  have  teen  limited  by  the 
necessity  for  rather  subjective  analysis.  Work  is  currently 
underway  coth  to  objectify  such  analysis  and  improve  the 
analytical  algorithms  involved.1’ 

In  sucrt,  there  currently  no  universal  ana  few 
conclusive  indicators  of  stress  in  a  speaker's  voice.  It  is 
widely  accepted  that  psychological  stress  has  an  influence 
on  vocal  production,  yet  hotly  contested  In  academic  and 
industrial  circles  what  the  best  Indication  cf  such  stress 
is.  In  a  military/ applications  context,  it  may  well  be  that 
tne  most  fruitful  area  of  inquiry  would  be  the  implications 
cf  stress  upon  the  voice,  from  the  viewpoint  of  productivity 
and  reliability  with  autonatic  speech  recogoizers. 

^Conversations  with  Dr.  I.  Biers,  University  of  Uayton, 
October  1962. 


III.  EXPER IMENTAL  COMPONENTS 


A.  GENERAL 

It  is  generally  recogni2ea  to  be  important  to  simulate 
an  an  experiment  tne  conditions  under  which  the  phenomena  or 
Interest  might  actually  he  expected  to  occur.  This  is  nc 
less  important  in  experiments  dealing  with  voice  input, 
where  there  is  a  clearly  definable  task  and  one  suspects  an 
operative  dui  unquantifiabie  factor  such  as  stress. 
Moreover,  with  voice  input,  it  is  essential  to  use  subjects 
representative  of  those  who  m.ight  actually  be  using  ASR  in 
the  task-environment  of  interest. 

c «  SUBJECTS 

Twen  ty-l'cur  volunteer  subjects  were  obtained,  maim/ 
t'rcm  the  C2  curriculum  at  the  Naval  Postgraduate  school.  The 
m.akeup  of  the  sample  is  shown  beiow: 

Ta  o le  1 

SUBJECT  DEMOGRAPHICS 


EXPERIENCED 

INEXPERIENCED 

MALE  MIL  11  ARY  1 

y 

i  e  ! 

SIMALi  MILITARY  | 

r< 

V. 

1  1  1 

l  1  1 

MALE  CIVILIAN  ! 

0 

<  1  1 

1  l  1 

i  EM  ALE  CIVILIAN  1 

t  ‘J  1 

1  <-  1 

TOTAL: 

IP 

12 

Twenty-one  ci  these  subjects  were  military  officers,  m  the 
grade  c f  03  and  04,  representing  the  four  services?  three 
were  frotr  civilian  agencies.  Six  ci  the  twenty-four  were 


female.  These  subjects  were  at  a  level  in  their  careers 
where,  especially  in  light  or  theii  C3  background,  they 
might  reasonably  expect  to  oe  at  the  mid-  tc  upper  levels  cf 
a  Command- Pc  si  staff.  In  short,  they  were  prime  candidates 
for  using  voice  input  in  a  Ccmmand-Fcst  scenario. 

Cf  tne  twenty-four,  half  haa  sore  prior  experience  witn 
Ad R ,  which  was  defired  for  the  purposes  of  the  experiment  as 
participation  in  three  or  more  experiments  at  the 
Postgraduate  School?  the  ethers  nad  none.  The  sexes  were 
evenly  aiviaei  oetween  the  two  groups.  Batchellor  [Ref.  10] 
found  nc  statistically  significant  difference  in  recognition 
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C.  VOCABULARY 

lor  this  experiment,  subjects  trained  a  fifty  word 
vocaouiary  (Appendix  A)  which  included  some  numerals  and 
various  ether  utterances.  The  vocabulary  was  chosen  with  a 
specific  purpose  in  mind.  It  has  been  shown  lBef.10]  that 


correct  recognition  cf  a  word  or  phrase  is  directly  related 
to  the  length  of  the  utterance.  In  this  light,  sore  words 
were  deliberately  chosen  over  others  tc  avoid  confusion  or 
possioie  misreccgn it  ion .  (lor  example  "send  message"  in  lieu 
of  the  shorter  send.)  Other  words  were  emitted  for  the 
same  reasons.  Some  of  the  vocaouiary  items  were  used 
infrequently ,  tut  were  necessary  for  the  task.  In  sum,  the 
vocaouiary  chosen  was  the  minimum  felt  necessary  to  enable 
composing  snort  messages  with  a  modicum  of  realism  and  some 
variety . 

L.  TASK 

lacn  subject  was  given  a  scenario  consisting  of  2W 
utterances  (Appendix  E),  with  each  word  or  phrase  numbered. 
These  utterances,  when  correctly  interpreted  oy  the 
recognizer,  brought  up  on  the  screen  an  output  resemDling 
messages  which  might  be  sent  or  serviced  by  a  sbipcoara 
Command  Fcst.  (The  ouput  from  these  message  sequences  is 
shown  in  the  right-hand  column  of  Appendix  B,  which  was  not 
furnished  to  experiment  participants;  for  examples  cf  the 
types  of  messages  see  Figure  1.)  Certain  utterances  produced 
a  mui tipie-word  output,  simulating  message-handling 
utilities  such  as  are  resident  on  the  ARPANET ,  with  which 
ail  subjects  were  familiar.  Verisimilitude  was  an  important 
consideration  in  the  design  of  tne  task: 
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SAMPLE  MESSAGE  OUTPUT 


SENE  MSG  TO:/  CY  BATTLF  GROUP  /  COMMANDER 
INFO  COPY  TO:  /  COMTHIKL' FLEET  /  CINCPACFLEET 
FROM:  /  USS  RATHBURNE 
POSITION  /  REPORT  /  NUMBER  /  1 
REAEINESS  LVL  /  2 

POSSIBLE  /  ELECTRONIC  /  SURFACE  /  CONTACT 
EEARING :  /  1/8/0  /  DISTANCE  /  2/7 
CONFIDENCE  LVL/  3 
REQUEST  /  INSTRUCTIONS 

QUIT./ 


FORWARD  MSG  #:  /  7 

INFO  COPY  TO:  /  CV  BATTLE  GROUP  /  COMMANDER 
FRCM :  /  USS  RATHBURNE 

QUIT./ 


(NB:  The  symbol  /  indicates  the  separation 
between  utterances.  Line  separation  is 
accomplished  by  speaking  "RETURN"  which 
yields  a  carriage  return.) 


Figure  1 

SAMPLE  MESSAGE  OUTPUT 
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The  point  of  departure  of  the  synthetic-work  approach  is 
a  behavioral  analysis  cf  the  performance  requirements 
placed  on  the  operator  oy  some  specific  system,  or  by  a 
class  of  such  systems  in  general.  Tasks  are  then 
selected  against  a  criterion  cf  ccrtent  validity  (i.e., 
tasks  are  selected  oecause  they  measure  functions  judged 
cy  experts  iu  the  field  to  be  important  to  the 
operational  situation  cf  interest).  A  general  criterion 
of  face  validity  is  also  imposed  (i.e.,  tne  tasks  are 
configured  tc  be  acceptable  to  target  populations,  e.g., 
pilots  ...  [3ef.ll:  pp.  22-22j 


Subjects  were  to  speak  the  utterances  in  order,  each  one 
until  the  proper  output  appeared  cn  their  screen.  Each 
subject  was  given  the  same  scenario  three  times  in  the 
course  cf  tne  experiment,  with  at  least  one  day  (cut  r.c  mere 
than  two)  between  reprises.  Cn  subsequent  trials,  subjects 
were  tcid  that  the  scenario  they  were  given  was  cf  exactl/ 
the  same  length  and  of  similar  content  to  the  one(s)  they 
hac  seen  previously.  The  same  scenario  was  used  repeatedly 
to  control  for  variability  oi  the  task,  ana  it  was  feit  that 
the  scenario  was  of  sufficient  length  and  variation  to 
negate  any  significant  learning  effect. 

E.  TIMS  STRESS  FACTOR 

During  the  first  phase  of  the  experiment  (Phase  l), 
subjects  were  allowed  to  complete  the  scenario  at  their  own 
pace,  ana  were  exhorted  not  to  rush  or  needlessly  linger, 
cf.  Appendix  C).  The  experimenter  attempted  tc  put  them  at 
ease.  Unknown  tc  the  subjects,  they  were  being  timed,  ana 
their  times  tc  complete  the  scenario  recorded.  This  time  was 


to  serve  as  an  individual  baseline  against  which  subsequent 
phases  would  be  timed.  In  the  next  phases  (Phases  2  and  2) 
subjects  were  allotted  two-thirds  anc  then  one-third  cf 
their  baseline  times,  respectively,  tc  attempt  tc  complete 
the  scenario.  It  was  anticipated  that  not  all  subjects 
would  complete  me  scenario  under  the  shortened  tire 
allowance:  scoring  was  adjusted  to  reflect  the  amount  cf  the 
task  actually  completed,  (cf.  IV. E  infra) 

The  time-corpr ess icr  was  intended  to  induce  stress  In 
tne  sucjects.  Iney  were  intormea  at  the  outset  of  how  much 
time  they  wcuia  have  tc  complete  a  tasn  with  which  they  were 
already  familiar.  A  large  Gre-Lac  aarKroor  timer  was  plainly 
visicie  in  front  cf  the  subjects  tc  shew  time  remaining. 
Additionally,  the  experimenter  reminded  the  subjects 
insistently  at  t  hir  ty-^ccna  intervals  of  their  time 
remaining.  Subjects  could  guage  their  own  progress  ir.  the 
c<jd  item  scenario.  Apriori,  it  was  felt  that  the  two-thiras 
time  would  constrain  some  out  doc  axl  subjects,’  it  was 
reasonably  certain  that  allowing  cnly  one-third  of  the 
caseime  time  would  ao  so.  This  opinion  seemed  to  pe  borne 
out  in  pilot  trials  (with  subjects  net  used  in  the  later 
experiment ) . 

i.  ECUIFPENT  ANE  SETUP 

The  voice  recognition  system  employed  was  the  T 
recognizer  manufactured  by  Threshold  Technology,  Inc.  (See 


iigure  2  for  a  block  diagram  of  the  system.)  Ibis  is  a 
discrete  utterance  analyzer,  which  analyzes  samples  cl' 
speech  up  to  a  raximum  ot  two  seconds  in  length,  separated 
by  a  distinct  pause  cf  at  least  ODe-tenth  of  a  second. 
Utterances  cf  mere  than  cue  word  are  allowed,  as  long  as  tne 
words  are  run  together  and  pronounced  "naturally",  without 
distinct  creaks  between  them. 

The  ItZ*  has  a  maximum  capacity  cf  256  utterances.  In 
the  training  mode,  the  subject  repeats  a  werd  or  pnrase  ten 
times.  (Under  certain  circumstances,  fewer  repetitions  nay 
oe  usea,  but  this  requires  moaifi cation  of  the  software  and 
yields  less  consistent  results  [Ref.  10j )  .  After  the  tenth 
repetition,  the  ten  are  superimposed  and  normalized  in 
length  to  ferm  a  template  for  that  utterance.  Turing  this 
experiment,  a  training  standardization  measure  was  adopted, 
knen  the  subjects  had  finished  training,  they  repeated  the 
vocabulary  list  in  toto  at  least  twice,*  any  items  not 
correctly  recognized  on  at  least  twe  cf  three  passes  were 
retrained  . 

When  the  user  speaks,  his  speech  sample  is  compared 
against  aii  these  templates  for  the  cest  match.  If  within 
preset  tolerances,  the  utterance  is  recognized  (correctly  or 
not)  and  the  output  string  sent  to  the  display  screen.  If  no 
natch  is  louna,  tne  machine  produces  an  audible  "beep".  Each 
individual  may  store  his  voice  template  on  a  data-tape 
cartridge  for  future  use,  and  thus  bypass  the  lengthy 
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training  process  in  future  sessions.  Each  utterance  is 
programmed  by  the  user  (in  this  case,  the  eiperlmenter )  with 
a  prompt  for  training.  When  that  utterance  is  cued  at  the 
Keyboard,  the  prompt-string  appears  on  the  display!  it 
disappears  after  the  tenth  repetition.  After  training,  the 
recognizer  shifts  to  the  reccgniton  ncae  ana  awaits  vernal 
input.  When  utterances  are  recognized,  an  associated 
pr eprcgrammed  output  string  is  sent  to  the  display  screen, 
'i'ae  output  string  corresponding  to  an  utterance  is  user- 
selected,  allowing  up  tc  a  maximum  cf  sixteen  alphanumeric 
cha rec  te rs  . 

wneu  used  in  conjunction  with  a  computer,  the  T€0Z  may 
be  used  in  either  direct  or  buffered  modes.  "Direct"  sends 
tae  output  string  to  the  computer  without  delay.  The 
"Buffered"  mode  allows  the  user  tc  verify  the  output  before 
allowing  it  to  te  sent.  The  direct  mode  was  usee  here.  In 
this  experiment,  the  760V  was  used  in  a  stand-alone 
capacity:  output  strings  were  sent  tc  the  display  screen  and 
nowhere  else,  (for  further  discussion  see  [Hef.  12]  ) 

Tne  microphone  used  was  a  Shure  SM0  noise  cancelling 
unit.  It  was  worn  oy  all  subjects  on  the  left  side  cf  the 
head,  directly  below  and  slightly  to  the  left  of  center  of 
the  lips.  Tc  further  attenuate  echo  and  outside  noise,  the 
experiment  was  conducted  inside  an  Industrial  Acoustics 
Company  soundproof  chamber.  (See  figure  3  for  the  actual 
equipmea t  setup )  . 
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•Figure  3 
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IV.  m'ERIMMilL  EESIGN 


A.  OBJECTIVE 

The  primary  objective  was  to  see  if  tijre  stress  affected 
users'  recognition  rates  with  this  voice  reccgnlticn  setup. 
After  the  initial  training  session,  (and  after  a  delay  of 
three  to  four  days)  subjects  began  the  pcrticn  cf  the 
experiment  which  involved  the  scenario.  They  were  fitted 
with  tne  microphone  and  haucec  the  scenario.  For  each 
respective  pnase,  they  were  read  the  instructions  in 
Appendices  C,D,  and  E.  The  intention  was  to  induce  stress  in 
suojects  in  the  iast  two  peases  cf  the  experiment  cy 
progressively  shortening  the  tine  allowed  to  penorir.  a  task 
cf  fixed  length.  C empress icc  cf  time  allowed  was  tc  be 
correlated  with  percent  correct  recognition  rate. 

It  was  neped  tnat  several  factors  were  to  be  at  wcrK  cn 
the  participants.  There  was  a  definite  competition  for  the 
subjects'  mental  resources.  Subjects  haa  to  read  a  word  or 
phrase  1  rem  their  printed  scenarios ,  speas  it ,  and  verify 
tnat  it  appeared  correctly  on  tneir  screen.  If  misrecognl zed 
cr  "oeepec",  they  were  to  try  again  until  correctly 
recognized.  The  experimenter  instructed  subjects  to  proceed 
to  the  neit  scenario  item  if  they  had  five  unsuccessful 
attempts,  although  they  haa  no  foreknowledge  of  this.  This 
was  yet  another  factor  in  the  stress  equation:  failure  at  a 


Z\d 


tasK  is  in  itself  stressful  for  frost  individuals .  [Eef.  13: 
p . 151] 

At  the  cutset  cf  each  of  the  last  two  ( tirre-constrained ) 

phases  of  the  experiment,  subjects  were  informed  of  how  much 

time  they  would  oe  allotted  to  perform  the  scenario.  After 

the  second  and  before  the  last  phase,  many  indicated  to  the 

experimenter  that  they  " ew "  they  would  have  less  time  than 

in  the  previous  session.  This  suspicion  was  desired  and  was 

calculated  to  create  a  measure  of  negative  anticipation  in 

the  participants.  During  the  last  two  phases  cf  the 

experiment,  subjects  were  reminced  insistently  by  the 

experimenter  cf  time  remaining,  in  addition  to  the  presence 

cf  the  nigniy  visible  timer.  It  was  estimated  that  this 

ccmclr.aticn  of  factors  would  induce  stress: 

Tae  apprehension  of  psychological  stressors  is  a  much 
rore  complex  business.  It  depends  .on:  (l)  prior 
experience  with  a  ciass  of  stresscrs;  (2)  remembering 
tne  experience;  (3}  recognizing  that  the  present 
stimulus  gestalt  is  au  instance  cf  the  earlier  class  of 

stresscrs;  and  .4)  believing  that  the  iifceiihocd  of  the 

threatened  occurrence  is  above  some  subjective 

threshhoid.  (Ref.  13:  p.  142] 


moreover. 

While  the  experimental  manipulation  allows  a  fairly  high 
degree  cf  control  over  the  stimulus  conditions,  the 
reaction  patterns  of  subjects  are  often  difficult  to 
determine.  ...  Thus,  the  stress  level  induced  may  be 
minimal  for  some  subjects,  whereas  it  may  be  moderate  or 
rattier  strong  for  others.  Since  it  is  often  difficult  to 
determine  a  definite  external  criterion  for  stress  (even 
psychophys i olcgicai  variables  cannct  always  be  expected 
to  indicate  the  stress  level  of  an  individual 
consistently  ana  validly,}  experimenters  using 
laboratory  stress  induction  can  oDiy  hope  that  they  have 
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been  able  to  induce  a  reasonable  amount  of  stress  In 
tnelr  subjects.  IRef.  9:  p.  i?ej 

The  fundamental  hypothesis  being  tested  here  may  be 
summed  up  as: 

He:  Tire  stress  has  no  effect  on  the  recognition 
rates  of  users  of  voice  recognition  systems. 

Hi:  Tirre  stress  affects  the  recognition  rates  of 
users  of  voice  recognition  systems. 

Additionally,  it  vas  hoped  to  see  whether: 

He:  Experienced  and  inexperienced  users  of  voice  recognition 
systems  are  affected  equally  by  time  stress. 

HI:  Experienced  and  ineiper ienced  users  of  voice  recognition 
systems  are  not  affected  equally  by  time  stress. 

B.  SCORING 

Subjects  were  monitored  during  their  performance  of  the 
scenario:  the  experimenter  sat  to  the  right  of  and  behind 
subjects  (cf.  figure  3)  from  which  position  both  the 
subjects'  actions  and  the  display  screen  were  clearly 
visible,  misreccgnit Ions  (improper  word  substitutions  by  the 
recognizer)  ard  non-recognitions  were  recorded.  A  confusion 
matrix  cf  substitutions  ar.d  "beeps"  is  included  as  Appendix 


i .  Subjects  occasionally  lost  their  place  in  the  scenario, 
repeating  some  vocabulary  items  they  had  already  spoken. 
These  repetitions  were  counted  as  additional  utterances,  and 
the  correct  or  incorrect  result  recorded?  thus,  no  penalty 
was  given  for  repetition.  The  caveat  in  the  instructions 
about  penalties  for  non-completion  of  the  scenario  was 
ccgus.  It  was  added  to  provide  an  additional  incentive  to 
finish,  and  in  tne  hope  of  reinforcing  the  subjects' 
apprehension.  (Subjects  completed  an  average  of  fifty-three 
percent  of  tne  scenario  in  Fhase  3?  none  finished.) 
Recognition  rates  were  cased  on  the  items  cf  the  scenario 
actually  completed.  Percentage  correct  recognition  was 
figured  as  tne  quotient  of  the  number  of  correct 
recognitions  divided  by  the  total  number  of  utterances 
spoaen  cy  the  subjects  (including  mis-  ana  non- 
recognitions  )  . 

At  the  end  of  each  phase  of  the  experiment,  subjects 
were  asked  to  rate  their  feelings  during  that  phase.  (See 
questionnaire.  Appendix  G).  live  surrogate  descriptors  were 
used  to  circumscxibe  tne  idea  of  "stressed"?  subjects  were 
asKed  if  they  f e It  : 

1.  nervous 

2.  confused 

2.  pressured 
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4.  irritated 


l.  anxious 

on  a  scaie  of  one  to  five,  five  being  the  highest  ievel  cf 
each.  It  was  thought  tnat  these  individual  response  levels 
[right  somehow  te  related  to  recognition  rates. 

G.  GONCIPHAL  EES IGN  OF  THE  EXPERIMENT 

The  design  employed  was  a  two-factor  rrixed  design  with 
repeated  freasures  on  one  factor.  [Ref.  14:  pp.  34-61] 
Alternatively  (and  for  purposes  or  clarity)  the  design  might 
be  thought  of  as  involving  two  conditions 
( experienced/ inexperienced )  and  three  treatments.  (See 
Figure  4  for  a  graphic  interpretation)  . 

As  stated  previously,  the  treatments  consisted  of  three 
variations  of  time  in  the  testing,  constituting  the  three 
phases  (eicludicg  the  training  phase). 


1.  Phase 

1 

baseline  tire:  no  constraint 

2.  Phase 

2 

2/3  baseline  time 

3.  Fhase 

3 

1/3  baseline  time 

These  phases  were  conducted  in  the  order  shown.  While  it 
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Figure  4 

CONCEPTUAL  DESIGN  OF  THE  EXPERIMENT 
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might  tie  thought  preferable  in  the  classical  design  of 
experiments  to  randomize  the  administration  of  the  time 
constrained  portions,  this  was  not  done.  Randomization 
might  have  controlled  for  the  possible  effects  of  "learning" 
between  Fhases  2  and  3;  this  benefit  was  consciously 
sacrificed  in  order  to  introduce  the  apprehension  factor 
discussed  above.  It  was  suspected  apriori  that  the  drastic 
time  compression  from  the  second  to  the  third  phase  of  the 
experiment  would  help  to  compensate  for  the  possible 
confounding  introduced  by  "learning".  Moreover,  the  scenario 
was  feit  to  oe  long  and  varied  enough  to  counteract  learning 
in  some  measure,  in  addition  to  the  exper inen ter 's  deception 
that  successive  scenarios  were  net  exactly  the  same.  The 
fact  that  ail  phases  were  separated  by  no  less  than  a  day 


V.  RESULTS  AND  ANALYSIS 

A.  ANALYSIS 

The  results  of  tne  experiment  as  described  are  shown 
graphically  in  Figure  t.  Before  conducting  any  analysis, 
Hartley's  test  for  equality  of  variance  was  performed  on  the 
raw  data.  The  rest  its  indicated  that  equality  cf  variance 
was  an  invalid  assumption,  and  thus  the  raw  data  required 
transionratio.::  after  arcsin  transformation,  the  equality  of 
variances  was  accepted.  (See  figure  6) 

The  analysis  of  variance  table  is  presented  in  Figure  ?. 
As  is  readily  apparent  from  the  graph  cf  the  raw  data, 
recognition  rates  for  the  experienced  group  decrease 
monotcaicaiiy  with  time  compression.  This  is  also  true  for 
Phases  k  ana  3  of  the  inexperienced  eroup.  The  ?  statistic 
shews  a  significant  difference  between  trials.  Therefore, 
the  null  nypothesis  of  time  stress  not  affecting  recognition 
rate  is  rejected. 

An  i:  eresting  phenomenon  is  observed  with  the 
inexperienced  group,  however.  From  Fbese  1  to  Phase  c  their 
success  rate  increases  ncticeaDiy,  despite  the  decreased 
time  allotted  for  the  tasjr.  This  is  hypothesized  to  be  the 
result  of  a  combination  cf  factors,  the  most  obvious  of 
which  is  learning.  From  their  first,  open-ended  trial  to 
the  second,  the  inexperienced  group  became  more  adept  at  the 
tasic.  hereever,  given  their  relative  naivete  with  the 
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SUMMARY  OF  EXPERIMENTAL  RESULTS 


RAW  DATA 

{For  percentages,  irultiply  by  100) 


X  St 


Fhase 
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EXPERIENCED 

Phase 

C 

.947 

.0012 

Phase 

3 
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Figure  5 

LATA  SUGARY  AND  HARTLEY'S  TEST 


SOURCE 

SS 

df 

rs 

i 

To  tal 

4.292 

71 

Eetv»een 

subjects 

2.074 

23 

Experience 

.  8fc42 

1 

.0642 

.702 

Errcr  ( t ) 

2.041 

22 

.0914 

in  ithir 
subjects 

2.216 

48 

Tire  Factor 

.2242 

2 

.1621 

4.27 

Tir eiEzperience 

.2216 

f 

.nee 

2.92 

Error  ( v* } 

1.672 

44 

.036 

figure  7 


ANALYSIS  Of  VARIANCE  TABLE 
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equipment  setup  ana  the  task,  inexperienced  subjects  tooir 
longer  to  complete  the  Initial  phase  of  the  experiment  and 
thus  had  a  longer  baseline  {an  average  of  7.8i»  minutes, 
versus  6.48  for  the  experienced).  In  the  linear  compression 
cf  Phases  2  and  3,  they  simply  had  more  time  than  the 
experienced  6rcup. 

It  was  not  known  apriori  if  allowing  two-thirds  of 
baseline  time  In  the  second  phase  wcuia  Impose  a  felt  ti^e- 
constraint  on  ail  subjects.  For  this  reason,  one-third  was 
chosen  fcr  the  final  phase:  although  seemingly  impossible, 
it  was  felt  that  such  a  drastic  time  reduction  would  induce 
stress  in  the  majority  cf  the  subjects.  Such  appears  tr  ce 
the  case.  Experienced  subjects  snow  a  decline  in  each 
successive  puase  cf  the  experiment.  Inexperienced  subjects, 
although  theoretically  stiii  on  the  u;wara  side  of  their 
"learning  curve"  shew  a  pronounced  decline  from  Pnase  2  tc 
Phase  Z . 

For  the  inexperienced  group,  both  the  rapidity  of  the 
learning  effect  ar,d  the  quickness  with  which  it  was  negated 
have  additional  significance.  A  Newmao-Xeuls  range  test 
.alpha  =  .2b)  was  performed  for  differences  in  the  means 
amongst  phases  of  the  experiment.  IRef.  16:  pp.  35-37] 
Phases  1  ana  2  were  seen  to  oe  signifies  . ly  different  from 
Phase  3,  cut  were  not  differentiable  between  themselves: 
this  is  obviously  aue  to  the  inexperienced  group  starting  at 
a  lower  rate,  then  coming  up  to  meet  the  experienced  group's 
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mean  score.  A  two-sample  "t"-test  shows  a  difference  between 
the  experienced  ar.a  inexperienced  groups  only  in  the  first 
phase.  (At  an  alpha  critical  of  .£69).  In  subsequent  phases 
of  the  experiment,  the  two  groups  are  statistically 
indistinguishable.  This  is  further  confirmed  by  the  F 
statistic  for  experience  in  the  analysis  of  variance.  The 
inference  to  be  drawn  from,  this  appears  to  be  that  the 
effects  of  teaming  cn  this  type  of  equipment  are  so  rapid 
as  to  m’cKe  the  two  groups  quicuiy  very  similar. 

H.  CCRBILAUCN 

rirst,  an  attempt  was  made  to  correlate  suDjects' 
questionnaire  responses  with  tne  phase  or  the  experiment  tc 
whicn  they  were  responding.  It  was  desired  to  see  if 
response  levels  were  somehow  dependent  on  the  time  stress 
levers  imposed  or.  the  participants.  If  sc,  a  positive 
correlation  would  show  a  rising  response  level  to  one  or 
mere  of  tne  five  questions  as  time  compression  increased 
witn  experiment  phase.  The  data  to  be  correlated  were 
nominal  (phase)  versus  ordinal  (1  to  5  cn  questionnaires)  so 
tne  ncnparametric  Spearman's  Rho  was  used.  [Ref.  17:  pp. 
2S2-2h6j  Table  2  shows  the  results: 
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Table  2 


Correlation  of  Questions  with  Phase 


QUESTION  KHO 


1.  nervous  .234  * 

2.  confused  .01? 

3.  pressurea  .532  * 

4.  irritated  .020 

5.  anxious  .230  * 


(*  indicates  significant  at  alpha  less  than  .05) 

Thus,  three  of  the  five  4uestions  shew  a  strong  correlation 
*itu  the  tine  compression  with  phase.  Ncreover,  these  three 
particular  descriptors  (nervous,  pressured,  ana  amicus) 
were  thought  to  De  the  closest  surrogates  of  the  five  tc 
otherwise  describe  the  situational  or  tire  stress  desired. 
(Toe  other  two  show  no  cerreia ti cn . )  It  appears  that  the 
experimental  conditions  were  successful  in  inducing  stress 
in  sons  treasure.  Mean  response  rates  for  subjects'  answers 
are  shewn  in  figures  5-12. 

Next,  an  attempt  was  made  to  correlate  subjects' 
quest ienna ire  responses  (cn  the  five  descriptors  already 
nentioaea)  to  their  recognition  rates  with  the  system.  It 
was  suspected  teforehaua  that  response  levels  tc  one  or  more 
of  these  questions  night  increase  nonotonicaily  from  one 
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Phase  l 


Phase  2 


figure  12 


Phase 


FEAN  P.ESPCNS2  LEVELS  TC  wUESTICN  5,  "Af.XIOUS 


phase  to  the  next  auring  the  conduct  cf  the  exper imen t .  It 
is  readily  apparent  that  this  was  not  the  case.  While  the 
five  surrogate  descriptors  were  deemed  capacie,  in  the 
aggregate,  tc  descriDe  stress,  they  apparently  were  not 
completely  successful  as  a  suejective  treasure. 

Tne  riss  cf  seif-inhibition  is  always  run  when  asfclr.g 
subjects  tc  rate  their  own  perceptions,  etrotions,  etc.  [Eef. 
18:  pp.  iUl-teZ]  Self-image  or  self-esteem  tray  prohibit  a 
respondent  frcrr  admitting  of  a  higher  level  cf  what  he  may 
feel  tc  ce  an  "undesirable"  feeling  cr  trait.  This  may  have 
been  the  case  here.  In  the  course  of  the  experiment,  the 
experimenter's  subjective  assessment  cf  the  subjects' 
emotional  states  did  not  agree  well  with  their  questionnaire 
responses.  Cn  several  occasions,  subjects  left  the 
experiment  giving  every  appearance  of  being  agitated  and 
exercised,  having  answered  with  a  "l"  (i.e.  not  at  ail)  to 
all  five  questions  . 

Moreover,  although  instructed  to  respond  only  In  regard 
to  each  individual  phase  of  the  experiment,  it  appears 
lively  that  subjects'  responses  may  have  been  in  the  context 
cf  their  previous  experiences  in  the  experiment.  Thus  one 
sees  the  rise  in  most  responses  from  the  first  (untimed)  to 
the  second  phase  of  the  experiment.  Curing  the  final  phase, 
subjects  were  expecting  to  be  placed  under  a  tighter  time 
constraint  (the  desired  appr ehensi cd  effect).  This  may  in 
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part  explain  the  absence  of  a  rise  in  response  levels  for 
this  last  phase. 

The  data  gathered  from  the  questionnaires  was  in  the 
ordinal  (1  to  5)  scale.  Since  an  attempt  was  to  be  made  to 
correlate  these  responses  with  recognition  rates  (interval 
data)  recourse  was  again  made  tc  the  non-parametri c 
Spearman's  Rho  procedure  for  correlating  the  ranks  of  both 
sets  of  data.  Recognition  rates  were  tested  against 
responses  to  all  five  questions  for  each  phase  of  the 
experiment.  Results  generally  showed  no  correlation. 

One  question  alone  provided  a  rather  consistent  negative 
correlation  with  recognition  rates.  For  the  inexperleucec 
suojects,  question  Z  ("confused")  showed  a  strong  negative 
correlation  (rho<  -e.4)  for  ail  three  phases  of  the 
experiment  (credible  at  an  alpha  less  than  cr  equal  to  0.1). 
This  would  imply  that  as  confusion  level  rose,  recognition 
rate  went  down.  It  is  Interesting  tc  ncte  that  there  was  nc 
correlation  found  between  this  variable  and  recognition  for 
the  experienced  group.  The  only  other  question  which  showed 
sore  (negative)  correlation  was  the  fourth  ("irritated"), 
but  this  relation  was  at  most  tenuous  ( e .25>a lpha>e . 1 ) .  Aiso 
of  note  is  the  fact  that,  after  the  initial  pnase  of  the 
experiment,  response  levels  for  the  two  groups  parallel  one 
another,  in  some  Instances  very  cicseiy.  This  would  appear 
further  to  corroborate  the  hypothesis  that  differences 
cetween  the  two  groups  diminish  rapidly. 


VI.  REMARKS 


If  psychological  stress  has  an  effect  on  the  voice,  as 
is  generally  acknowledged,  it  is  reasonable  to  expect  it  to 
have  an  effect  on  the  success  rates  cf  users  of  voice  input 
equipment.  As  discussed  previously,  ASR  technology  is  of 
potentially  great  interest  to  the  rrilitary  planner  in  the 
Command,  Control,  ana  Communications  context.  This 
preliminary  inquiry  has  attemptea  tc  shew  a  ccrrelaticn 
between  stress  ana  reccanition  success  with  such  equiprent. 
Several  areas  cf  ccnsiaeration  suggest  themselves. 

As  has  been  seen,  recogniton  rates  declinea  when  tne 
eperaters  cf  this  ASR  equipment  were  placed  under  tire 
stress.  An  alter native  approach  to  evaluating  the 
desiraoiiity  cf  this  technology  might  ce  tc  assess  voice 
versus  other  means  of  input  under  similar  conditions.  In 
this  study,  subjects  attempting  tc  complete  a  nearly 
irrpcssiole  tasir  (Phase  3)  nonetheless  managed  to  maintain  a 
correct  recognition  rate  cf  approximately  ninety  percent. 
With  manual  input  (e.g.,  typing),  each  characer  to  he  Input 
represents  an  opportunity  for  error:  how  this  might  he 
affected  ty  psychological  stress  is  open  to  speculation. 
When  one  considers  that  a  sixteen  character  output  string 
can  be  produced  with  one  voice  command,  it  may  well  be  that 
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voice  input  becomes  even  rrore  desirable  in  a  stressful 
environment . 

It  is  interesting  to  note  bow  quickly  the  inexperienced 
sutjects  "caught  up"  with  ana  paralleled  the  performance  of 
their  experienced  counterparts.  If  this  is  indeed  a 
generaiizable  phenomenon,  it  would  imply  that  after  some  few 
training  sessions  with  a  recognizer,  the  distinction 
vanishes.  If  so,  faced  with  a  potentially  adverse  situation 
a  commander  or  watch  officer  need  cot  worry  about  this 
experience- level  in  nis  Comma ca  Post  staffing.  Voice  input 
has  also  been  proposed  for  automated  reservation  and 
directory-assistance  scnemes.  Given  the  success  rates  for 
inexperienced  users  even  under  conditions  calculated  to 
induce  stress,  these  applications  appear  no  less  feasible. 

Lastly,  It  is  generally  acknowledged  that  a  more 
objective  method  for  assessing  psychological  stress  is 
desperately  needed.  Any  further  work  in  this  area  should 
find  an  alternative  and  hopefully  more  workable  method  of 
assessing  stress  in  subjects.  It  is  submitted  that  some 
relatively  non-intrusive  physiological  measures  of  stress  be 
considered.  Several  of  these  observable  and  quantifiable 
indicators  are  well  documented  in  existing  stress 


literature . 


APPENDIX  A 


*VOCABULABY  LIST* 

a************!***** 


#  WORD  # 


0.  ZERO 

25  . 

1.  CNE 

2e. 

2.  TWO 

27. 

2.  THREE 

28. 

4.  FOUR 

2y . 

5.  FIVE 

20. 

6.  SIX 

21. 

7.  SEVEN 

32. 

6.  EIGHT 

23. 

y.  NEGATIVE 

24. 

1(6 .  TERMINATE 

35. 

11.  SEND  MESSAGE 

26 . 

12.  ANSWER  MESSAGE 

37. 

12.  FORWARD  MESSAGE 

28. 

14.  cincpacfleit 

29. 

15.  THIRD  FLEET 

40. 

Id.  HATTIE  GROUP 

41  . 

17.  information 

42. 

16.  RETURN 

43. 

ly.  ?A  TAR 

44  . 

20.  VISUAL 

45 . 

21.  ELECTRONIC 

46. 

22.  CONTACT 

47  , 

22.  SIGHTING 

48, 

24.  SOVIET 

49 

WORD 

UNKNOWN 

POSSIBLE 

CERTAIN 

CONFIDENCE 

LAVAGE 

ASSESSMENT 

REQUEST 

INSTRUCTIONS 

READINESS 

INTENT 

ATTACK 

REPORT 

RATHBURNE 

FRCM 

COMMANDER 

SURFACE 

TIME 

ZULU 

PERIOD 

POSITION 

NUMBER 

BEARING 

DISTANCE 

UNDER 

ADVISE 
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♦SCENARIO  IISTING* 


CRT  OUTPUT 


1. 

SEND  MESSAGE 

Cm  • 

BATTLE  GROUP 

-l 

v>  ■ 

COMMANDER 

4. 

RETURN 

An 

*m  m 

INFORMATION 

6. 

THIREELEET 

7. 

CINCPACELEET 

S. 

RETURN 

y . 

ERCM 

10. 

RATHBURNE 

SEND  MSG  TO:  CV  BATTIE  GROUP 

COMMANDER 

INFO  COPT  TO:  COMTH IRDf IEET 

CINCPACELEET 

11 . 

RETURN 

FROM:  USS  RATHEURNE 

POSITION  REPORT  NUMBER  1 

12. 

POSITON 

REAEINESS  LVl  2 

POSSIBLE  ELECTRONIC  SURFACE 

CONTACT 

12. 

REPORT 

BEAR ING ; 160  DISTANCE  27 
CONFIDENCE  LVL  2 

14. 

NUMBER 

REQUEST  INSTRUCTIONS 

15. 

ONE 

QUIT. 

ie . 

RETURN 

17. 

REAEINESS 

16. 

THREE 

iy. 

RETURN 

20  . 

POSSIBLE 

54 


21.  ELECTRONIC 


22.  SURFACE 
22.  CONTACT 

24.  RETURN 

25.  BEARING 

26.  ONE 

27.  EIGHT 
26.  ZERO 

29.  DISTANCE 

20.  TWO 

21.  SEVEN 

22.  RETURN 

23.  CONFIDENCE 

34.  THREE 

35.  RETURN 

36.  REQUEST 

27.  INSTRUCTIONS 

28.  TERMINATE 


29.  FORWARD  MESSAGE 

40.  SEVEN 

41.  RETURN 

42.  INFORMATION 

43.  BATTLE  GROUP 


44.  COMMANDER 

45.  RETURN 


FORWARD  MSG  #:7 

INFO  COPT  TO:  CV  BATTLE  GROUP  COMMANDER 
FROM:  USS  RATHBURNE 

QUIT. 


46.  fROM 

47.  RATHBURNE 

48.  TERMINATE 


49.  ANSWER  MESSAGE 
£0.  TWO 

51.  FOUH 

52.  RETORN 

53.  CONTACT 
£4.  UNKNOWN 

55.  POSSIEIE 
£6.  SOVIET 
£7.  PERIOD 

56.  INTENT 
£9.  UNKNOWN 

60.  TERMINATE 


ANSWER  MSG  #:  24 
CONTACT  UNKNOWN  POSSIBLE  SO VI ET. 
^no  carriage  return  In  scenario; 
INTENT  UNKNOWN 

QUIT. 


61.  SENE  MESSAGE 

62.  BATTLE  GROUP 

63.  COMMANDER 

64.  RETURN 

65.  INFORMATION 

66.  THRIEUZET 

67.  CINCPACF LEET 

68.  RETURN 
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69.  CONTACT 


70.  SIGHTING 

71.  REPORT 

72.  NUMBER 

73.  SIX 

74.  RETURN 

76.  TIME 

76.  ONE 

77.  SEVEN 

78.  EOUR 

79.  EIGHT 

60.  ZULU 

61.  RETURN 
£2.  POSITION 

83.  ONE 

84.  ilVE 
£5.  SEVEN 

66.  TWO 

67.  ZERO 
£6.  RETURN 
69.  COMMANDER 

90.  ASSESSMENT 

91.  POSS I  ELI 


SEND  MSG  TO:  CV  BATTLE  GROUP  COMMANDER 
INFO  COPT  TO:  CCMTHIP.DFLSET  CINCPACFLEIT 
CONTACT  SIGHTING  REFORT  NUMBER  6 
TIME  1746  ZULU 
POSITION  16720 

COMMANDER  ASSESSMENT  POSSIBLE  SOVIET  SURFACE, 
(no  carriage  return  In  scenario) 

UNKNOWN 

QUIT. 


92.  SOVIET 


93.  SURFACE 
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y4.  PERIOD 


95. 

UNKNOWN 

96. 

TERMINATE 

97 . 

ANSWER  MESSAGE 

98. 

THREE 

99. 

EIGHT 

100. 

RETURN 

101. 

INFORMATION 

1012. 

C INCFACFIEET 

102. 

RETURN 

104. 

FROM 

ANSWER  MSG  #:  56 

INFO  COPY  TO:  C INCPACFLEET 

105. 

RATREURNE 

EROM:  USS  RATHBURNE 

UNKNOWN  SURFACE  CONTACT.  POSSIBLE  SOVIET 

106. 

RETURN 

UNKNOWN  RADAR  CONTACT  BEARING:  235 
DISTANCE  14 

107 . 

UNKNOWN 

NEGATIVE  VISUAL  CONTACT. 

CONFIDENCE  LVL  ASSESSMENT  6. 

106. 

SURE  ACE 

INTENT  UNKNOWN.  REQUEST  ADVISE 

129. 

CONTACT 

O';  IT. 

lie. 

PERIOD 

Ill . 

PQSSIE1E 

Ilk. 

SOVIET 

115 . 

RETURN 

114. 

UNKNOWN 

115. 

RADAR 

lie. 

CONTACT 

117.  BEARING 


119.  THREE 

120.  FIVE 

121.  RETURN 

122.  DISTANCE 
122 .  ONE 

124.  FOUR 

125.  RETURN 
125.  NEGATIVE 
127.  VISUAL 
129.  CONTACT 
129.  FSRIOD 
122.  RETURN 

131.  CONFIDENCE 

132.  ASSESSMENT 
132.  SIX 

134.  PERIOD 
135  .  RETURN 

135.  INTENT 
137.  UNKNOWN 

136 .  PERIOD 

139.  REQUEST 

140.  ADVISE 

141.  TERMINATE 


142.  SENE  MESSAGE 


142 .  BATTLE  GROUP 


144.  COMPANDER 
145  .  RETURN 

146.  INFORMATION 

147.  THIRIFLEET 
146.  RETURN 

149.  FROM 

150.  RATHBURNE 

151.  RETURN 

152.  CONTACT 
152.  REPORT 

154.  NUMBER 

155.  SIX 

156.  RETURN 

157.  CONTACT 
156.  RIGHTING 
159.  RETURN 
162 .  CONTACT 

161.  CERTAIN 

162.  SOVIET 
162.  SURFACE 

164.  RETURN 

165.  BEAPING 

166.  TWO 


SEND  MSG  TO:  CV  BATTLE  GROUP  COMMANDER 

INFO  COPY  TO:  COMTHIREHEET 

FROM:  USB  RA1HBURNE 

CONTACT  REPORT  NUMBER  6 

CONTACT  SIGHTING 

CONTACT  CERTAIN  SOVIET  SURFACE 

BEARING:  227 

DISTANCE  14 

INTENT  UNKNOWN 

QUIT. 
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167.  THREE 


*  a-*: 


166 .  SEVEN 
lets.  RETURN 
l?e.  DISTANCE 

171.  ONE 
l?*.  TOUR 

172.  RETURN  i 

i 

174.  INTENT  j 

175.  UNKNOWN 

176.  TERMINATE 


177.  SEND  MESSAGE 

178.  THIKDELEST 
17b'.  RETURN 

180.  iNECRMAl’ICN 

181.  CINCFACFI2ET 
168.  RETURN 

162.  FROM 
ie4.  RATHBURNE 
166.  RETURN 
166.  COMMANDER 
187.  ASSESSMENT 
168.  BATTLE  GROUP 
iey.  under 
lbO.  SOVIET 
lyl .  ATTACK 


SEND  MSG  TC:  CCMTH IRDFL2ET 

UikQ  COPT  TC:  CINCPACiLEET 

IROM:  USS  RaTHBURNE 

COMMANDER  ASSESSMENT  CV  BATTLE  GROUP 

(no  carriage  return  in  scenario) 

UNDER  SOVIET  ATTACK.  DAMAGE  LVL 
(no  carriage  return  in  scenario) 
UNKNOWN.  REQUEST  ADVISE  USS  RATHBURNE 
(no  carriage  return  in  scenario) 
INTENT 

QUIT. 


ei 


iy2.  PERIOD 


192.  DAMAGE 

194.  UNKNOWN 
19t.  PERIOD 
lye.  REQUEST 
197.  ADVISE 
196.  RATHEUBNE 

195.  INTENT 
2Z2.  TERMINATE 

END  SCENARIO 


62 


APPENDIX  C 


INSTRUCTIONS  TC  EXPERIMENT  PARTICIPANTS  "PHASE  1" 


The  experiment  you  viii  be  participating  la  simulates 
what  might  happen  in  a  shipboard  command  post.  You  will  be 
using  the  vocabulary  you  have  already  trained  during  the 
first  session. 

Ycu  will  oe  given  a  list  of  werds  to  speaJi  to  the 
machine:  when  recognized,  they  will  come  up  on  your  screen. 
Certain  warns  produce  a  multiple  word  output,  as  you  have 
already  seen.  These  are  an  attempt  tc  simulate  message- 
handling  utilities  such  as  are  foind  on  the  ARPANET . 

Speak  clearly  and  distinctly,  tut  don't  overenunciate. 
Nc  provision  has  been  made  to  let  ycu  back  up  and  correct 
mistakes,  misreccgnitions,  etc.,  so  jcu  must  continue  to  say 
the  word  until  it  is  recognized.  If  you  rlss  a  word  and 
don't  catch  it,  I  will  prompt  you  for  the  word  you  missed? 
please  simply  try  again  cl  the  word  I  prompt  you  with. 
Pauses  between  words  may  te  short,  tut  they  must  be 
distinct. 

Set  ycur  own  pace, don't  rush  but  don't  linger.  You  are 
not  competing  against  anyone  (Including  yourself)  for  time 
or  recognition,  so  don't  be  nervous.  No  grades  are  being 
given  out  here.  Once  you  start,  however,  please  continue  to 
the  end. 

I  will  be  glad  to  answer  any  questions  you  may  have 
about  the  conduct  of  this  phase  of  the  experiment.  Any 
questions  about  the  purpose  and  outcome  I  will  be  glad  tc 
discuss  at  great  length  once  the  actual  experimentation 
phase  is  threugn. 

Thanks  again  for  ycur  help. 
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APPENDIX  r 


INSTRUCTIONS  TO  EXPERIMENT  PABTICIFANTS  ’''PHASE  2* 


T la  1  s  is  Phase  2  of  the  experiment.  In  this  phase  you 
will  participate  in  a  Command  Pest  scenario  very  similar  in 
content  and  length  to  the  first  scenario  you  were  given. 

Tae  most  significant  difference  in  this  phase  is  the 
time  factor.  In  Phase  2  you  will  be  allotted  only  2/2  of  the 
time  you  originally  took  to  complete  the  entire  scenario.  In 
fact,  you  ray  not  he  able  to  complete  the  entire  scenario. 
It  is  to  ce  erphaslzed  that  your  primary  objective  is  tc 
complete  as  much  of  the  scenario  as  you  possiDly  can,  as 
accurately  as  possible.  In  this  command  post  situation,  the 
messages  ycu  will  be  handling  are  cf  an  urgent  nature,  and 
your  primary  motivation  is  to  get  as  mar.y  of  them  completed 
as  possible,  Yci.  are  being  scored  for  both  recognition  rats 
and  tne  percentage  of  the  scenario  ycu  complete.  Any  parts 
you  ac  not  cor-  i  lett  will  count  against  ycur  overall  score, 
unfortunately .... 

During  tne  experiment,  I  will  give  you  frequent 
reminders  of  the  time  you  nave  remaining  a t  30  second 
intervals.  The  time  I  announce  will  be  in  minutes  and 
seconds  remaining.  The  timer  will  also  furnish  ycu  with  a 
running  indication  of  time  remaining. 

Again,  attempt  to  speak  clearly  and  distinctly.  No 
provision  has  beer,  made  to  back  up  and  correct  mistakes  or 
nisrecognitions,  so  you  must  repeat  the  word  or  phrase  until 
it  is  correctly  recognized.  If  you  miss  a  word  and  don't 
eaten  it,  I  will  prompt  ycu  for  the  word  ycu  missed:  please 
try  again  on  tne  word  I  prompt  you  with. 

Let  re  re-emphasize  here  that  the  shortened  time- 
aiicwance  is  meant  to  represent  the  urgency  of  the  situation 
in  the  Command  Fost,  and  that  your  principal  goal  is  to 
finish  as  much  of  the  scenario  as  possible. 

Thanks  again  for  your  help. 
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INSTBUCTICNS  TC  EXPERIMENT  PARTICIPANTS 


•PHASE  3* 


This  Is  Phase  3  of  the  experiment.  In  this  phase  yov 
will  participate  in  a  Command  Post  scenario  very  similar  in 
content  and  length  tc  the  first  scenario  yen  were  -aiven. 

The  rrost  significant  difference  in  this  chase  is  the 
litre  factor.  In  Phase  3  you  vU.1  be  allotted  only  1/3  of  the 
tire  ycu  originally  tcok  tc  complete  the  entire  scenario.  Tr 
fact,  you  tray  net  be  able  to  complete  The  entire  scenario. 
It  is  to  te  emphasized  that  ycur  primary  objective  is  to 
complete  as  much  of  the  scenarie  as  ycu  possibly  can,  as 
accurately  as  pcssible.  In  this  command  pest  situation,  the 
messages  you  will  be  handling  are  cf  an  urgent  nature,  er.d 
ycur  primary  motivation  Is  tc  get  as  many  cf  them  completed 
as  possicle.  vou  are  being  scored  fer  both  recognition  rate 
and  the  percentage  cf  the  scenario  you  complete.  Any  parts 
ycu  dr  net  com-  plete  win  count  against  ycur  overall  score, 
unf crturately .... 

Turing  the  experiment,  I  will  give  you  frequent 
reminders  cf  the  time  y^u  have  remaining  at  3?  ceccrd 
intervals.  The  time  I  announce  will  be  in  minute?  end 
seccnas  reraining.  The  timer  will  also  furnish  yov  with  a 
running  indication  cf  time  remaining. 

Again,  attempt  to  speeic  clearly  and  distinctly.  I'o 
provision  has  been  rede  tc  tack  up  ana  correct  mistakes  cr 
mlsreccgcl  1 1  cr.s ,  sc  yov  must  repeat  the  word  cr  phrase  until 
it  is  correctly  recognized.  If  you  rriss  a  word  and  don't 
catch  it,  I  will  prerpt  ycu  for  the  verd  you  missed:  please 
try  again  cc  the  word  I  prompt  ycu  with. 

Let  me  re-emphasize  here  that  the  shortened  tire- 
allowance  is  meant  to  represent  the  urgency  of  the  situation 
in  the  Ccmmard  Fcst,  and  that  ycur  principal  meal  is  to 
finish  as  much  of  the  scenario  as  possible. 

Thanks  again  for  ycur  help. 
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UUEETIONNAIBE 

********************************  jit*************,,*,, 

SUBJECT  NUMBER . 

DATE . 

PHASE . 

Please  answer  tie  following  questions  concerning  your 
feelings  during  the  phase  of  the  experiment  you  have  J'.’St 
completed.  These  questions  pertain  ONLY  to  this  phase. 

Circle  the  response  which  best  describes  your  feelings. 

The  scale  1st 


1  =  NOT  AT  ALL 
3  =  MCDERAT3IY  OR  SCREW EAT 
5  =  VERY  MUCH  SC 


Re 

sponses  2  and 

4  are  the  mid 

points  between  the  above 

• 

CU 

ISTION:  {Circle  one  fcr  each 

question) 

During  this 

phase  of  the 

experiment,  I  felt: 

net  at  all 

moderately 

n> 

<5 

^uch  s 

1. 

nervous 

1 

2 

3  4 

P 

2. 

confused 

1 

c 

3  4 

5 

rt 

w  • 

pressured 

1 

2 

3  4 

4. 

irritated 

1 

2 

3  4 

C 

a. 

anxious 

1 

c 

3  4 

R 

sw 

*****pl£asz,  Dcd 

'i  discuss  your 

experience  or  answers 

with 

anyone 

****** 
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